On the Perception of Transients: Applying Psychophysical Constraints to Improve Audio Analysis and Synthesis
نویسندگان
چکیده
Recent advances in audio representation decompose the signal into a parallel combination of sinusoidal, transient, and noise components and then apply specialized techniques to analyze and synthesize each of these components (Levine and Smith 1999). With respect to time-varying sinusoids and wideband noise, knowledge about auditory perception has been used to optimize the representation. In contrast, relatively less is known about the perception of transients so that the processing of transients is governed primarily by standard signal transform techniques. The present paper presents new psychophysical results on the perception of transients and suggests how these results may be used to improve the analysis and synthesis of transients in the audio signal. Limits on temporal resolution in auditory perception typically range from 25 ms, for the judgment of temporal order, to a lower limit of 1-2 ms for the discrimination of monaural phase (Eddins and Green 1995). For example, Ronken (1970) measured discrimination thresholds for a pair of unequal amplitude clicks and their time-reversed form as a function of the temporal separation between the clicks and the relative amplitude of the clicks. He demonstrated that the threshold amplitude was relatively constant for temporal separations greater than 2 ms, but that the threshold increased when decreasing the temporal separation from 2 to 1 ms. Patterson and Green (1970) studied the discrimination of Huffman sequences, which, for a given duration, share the attribute that they have identical power spectra. They found that discrimination between different Huffman sequences
منابع مشابه
An Investigation of the Linguistic, Paralinguistic and Sociocultural Effects of Input on the Perception and Translation of Gerunds by Persian Speakers of English
In this study, it was intended to investigate the Persian native speakers’ perception of gerunds by three different elicitation techniques i.e., written, audio, and pictorial through translation. Eighty intermediate learners of English were asked to select Persian translation of the gerund formsin these elicitation techniques. They were asked to choose one option from a pair of written first la...
متن کاملIdentifying the challenges to good clinical rounds: A focus-group study of medical teachers
Introduction: The use of clinical rounds, as an integral part ofclinical teaching to help medical students acquire essential skillsof practicing medicine, is critically important. An understandingof medical teachers’ perceptions concerning the challenges ofclinical rounds can help identify the key areas of focus to betterfoster professional development of medical students. This studyexplored th...
متن کاملGenerating Optimal Timetabling for Lecturers using Hybrid Fuzzy and Clustering Algorithms
UCTTP is a NP-hard problem, which must be performed for each semester frequently. The major technique in the presented approach would be analyzing data to resolve uncertainties of lecturers’ preferences and constraints within a department in order to obtain a ranking for each lecturer based on their requirements within a department where it is attempted to increase their satisfaction and develo...
متن کاملApplying Grey E-S-QUAL Model to Evaluate the Gaps between Expectation and Perception of the Customer Based on E-services Quality: A Case Study of an Iranian Online Retailer
This study aims to apply Grey system based on modified E-S-Qual model to analyze e-service quality.Questionnaires on the basis of E-S-Qual model, which consisted in 7 dimensions, were distributed among customers of 5040.ir, an online retailer in Iran. 251 questionnaires were obtainedfrom the customer’s website. After applying the method and calculating the scores in each dimension, the gap betwe...
متن کاملAn analysis/synthesis tool for transient signals that allows a flexible sines+transients+noise model for audio
We present a flexible analysis/synthesis tool for transient signals that extends current sinusoidal and sines+noise models for audio to sines+transients+noise. The explicit handling of transients provides a more realistic and robust signal model. Because the transient model presented is the frequency domain dual to sinusoidal modeling, it has similar flexibility and allows for a wide range of t...
متن کامل